3574 results found.
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
50 000 sentences Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Penn Treebank annotation manualLanguage Type:
Trilingual
Languages:
English German Portuguese
Availability:
From Owner
License:
None
Size:
10000000 Production Status:
Newly created-finished
Use:
Text Mining
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
900 sentences Production Status:
Newly created-in progress
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
?
Size:
? GByte Production Status:
Existing-used
Use:
Dialogue
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
NIST Agreement
Size:
23 document clusters and Pyramids Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
http://www-nlpir.nist.gov/projects/duc/data.htmlLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
352 texts Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
747 MByteProduction Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
NoLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
Yes
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
2114 surface realizations Production Status:
Newly created-finished
Use:
Natural Language Generation
Paper:
N/A
Documentation:
Yes, English
Speech Transcript
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>




